Introduce a process-wide singleton engine for `.collect(engine="gpu")` by madsbk · Pull Request #22410 · rapidsai/cudf

madsbk · 2026-05-07T16:55:47Z

lf.collect(engine="gpu") and pl.GPUEngine(executor="streaming") using the default cluster now route through a new process-wide DefaultSingletonEngine instead of constructing a fresh rapidsmpf Context, RMM adaptor, and Python executor for every query. Bootstrap now happens once per process rather than once per query.

DefaultSingletonEngine is a process-wide single-GPU singleton specialization of SPMDEngine: at most one live instance exists per process, it always uses a single-rank communicator plus default environment-derived settings, and repeated calls reuse the same engine instance until explicit shutdown.

The default cluster enum value is renamed from Cluster.SINGLE to Cluster.DEFAULT_SINGLETON so the dispatch token better reflects the actual behavior.

This PR also removes the dead inline-context fallback in evaluate_pipeline, which was the original "single" execution path.

madsbk · 2026-05-09T08:13:44Z

-    Because each call forks a new child, process-wide side-effects
-    (the ``_bind_done`` flag, CPU affinity, environment variables) never
-    leak between tests or back into the pytest process.
+def _run_in_subprocess(target: Callable[[], None]) -> None:


This PR exposed some issues with the "fork" approach, so we now use "spawn" instead. Otherwise, the tests remain the same.

Co-authored-by: Lawrence Mitchell <wence@gmx.li>

TomAugspurger

Partial review. I'll try to get back to this later, but don't wait for me.

TomAugspurger · 2026-05-11T15:44:59Z

-      streaming runtime.
-    * ``Cluster.DASK`` : Multi-GPU execution via Dask workers and the rapidsmpf
-      streaming runtime.
+    * ``Cluster.DEFAULT_SINGLETON`` : Single-GPU execution via the DefaultSingletonEngine.


Let's confirm this is the name we want.

if we ever change the default, then this name will become misleading

How commonly understood is "singleton"?

This name doesn't mention "single" GPU-only, though the docs do.

And given that this is the default... maybe we can get away with updating the call sites to cluster: Cluster | None? And if we encounter None then we set up the default singleton cluster, and so we don't even need an enum name for this thing?

I guess you are right, but I still prefer the more descriptive name. I like DEFAULT_SINGLETON and DefaultSingletonEngine because that is exactly what they are :)

If we ever decide to change the default implementation, I think we should change what DefaultSingletonEngine does internally rather than route the default path to an entirely different engine type.

To me, the important semantic is “process-wide implicit singleton default engine”, and the current name makes that very explicit.

wence-

Tiny suggestions

…/default_singleton_engine.py Co-authored-by: Lawrence Mitchell <wence@gmx.li>

madsbk · 2026-05-12T12:38:05Z

/merge

rapidsai#22410) `lf.collect(engine="gpu")` and `pl.GPUEngine(executor="streaming")` using the default cluster now route through a new process-wide `DefaultSingletonEngine` instead of constructing a fresh rapidsmpf `Context`, RMM adaptor, and Python executor for every query. Bootstrap now happens once per process rather than once per query. `DefaultSingletonEngine` is a process-wide single-GPU singleton specialization of `SPMDEngine`: at most one live instance exists per process, it always uses a single-rank communicator plus default environment-derived settings, and repeated calls reuse the same engine instance until explicit shutdown. The default cluster enum value is renamed from `Cluster.SINGLE` to `Cluster.DEFAULT_SINGLETON` so the dispatch token better reflects the actual behavior. This PR also removes the dead inline-context fallback in `evaluate_pipeline`, which was the original `"single"` execution path. Authors: - Mads R. B. Kristensen (https://github.com/madsbk) Approvers: - Lawrence Mitchell (https://github.com/wence-) URL: rapidsai#22410

madsbk self-assigned this May 7, 2026

madsbk added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels May 7, 2026

github-actions Bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels May 7, 2026

github-project-automation Bot added this to cuDF Python May 7, 2026

GPUtester moved this to In Progress in cuDF Python May 7, 2026

madsbk force-pushed the default_gpu_engine branch 9 times, most recently from 7e6beeb to 0fb9fe8 Compare May 9, 2026 08:07

madsbk commented May 9, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/sort.py

madsbk commented May 9, 2026

View reviewed changes

Comment thread python/cudf_polars/tests/experimental/test_dataframescan.py

madsbk force-pushed the default_gpu_engine branch from 0fb9fe8 to e7fe81a Compare May 9, 2026 08:17

DefaultSingletonEngine

34498a9

madsbk force-pushed the default_gpu_engine branch from e7fe81a to 34498a9 Compare May 9, 2026 08:17

madsbk added breaking Breaking change and removed non-breaking Non-breaking change labels May 9, 2026

Remove the legacy "single" path

f2fa352

madsbk force-pushed the default_gpu_engine branch from 770331a to f2fa352 Compare May 9, 2026 12:28

madsbk marked this pull request as ready for review May 9, 2026 13:59

madsbk requested a review from a team as a code owner May 9, 2026 13:59

madsbk requested a review from mroeschke May 9, 2026 13:59

rapidsai deleted a comment from copy-pr-bot Bot May 9, 2026

wence- reviewed May 11, 2026

View reviewed changes

madsbk and others added 5 commits May 11, 2026 15:40

Apply suggestions from code review

2f7b115

Co-authored-by: Lawrence Mitchell <wence@gmx.li>

Merge branch 'main' of github.com:rapidsai/cudf into default_gpu_engine

432e590

remove delattr(dask_worker, attr) guard

c5ee511

cleanup

22312a2

Merge branch 'main' of github.com:rapidsai/cudf into default_gpu_engine

806cf80

madsbk requested a review from wence- May 11, 2026 14:57

TomAugspurger reviewed May 11, 2026

View reviewed changes

mroeschke reviewed May 11, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/frontend/default_singleton_engine.py

Comment thread python/cudf_polars/tests/experimental/test_default_singleton_engine.py

madsbk added 6 commits May 11, 2026 19:33

moved check_no_live_default_singleton() call inside StreamingEngine

4a1903a

rename to get_or_create()

f2687e3

test collect(engine=gpu")

138af2d

Merge branch 'main' of github.com:rapidsai/cudf into default_gpu_engine

108c019

Merge branch 'main' of github.com:rapidsai/cudf into default_gpu_engine

5d7803e

Merge branch 'main' of github.com:rapidsai/cudf into default_gpu_engine

3b9e5c0

madsbk requested review from TomAugspurger and mroeschke May 12, 2026 05:52

wence- approved these changes May 12, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/frontend/default_singleton_engine.py Outdated

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/frontend/default_singleton_engine.py Outdated

madsbk and others added 2 commits May 12, 2026 02:05

Update python/cudf_polars/cudf_polars/experimental/rapidsmpf/frontend…

da4c0c4

…/default_singleton_engine.py Co-authored-by: Lawrence Mitchell <wence@gmx.li>

docs

6ad85b1

rapids-bot Bot merged commit 57e27e7 into rapidsai:main May 12, 2026
89 checks passed

github-project-automation Bot moved this from In Progress to Done in cuDF Python May 12, 2026

madsbk deleted the default_gpu_engine branch May 12, 2026 12:39

coderabbitai Bot mentioned this pull request May 12, 2026

Move replicated-output dedup to the Dask and Ray frontends #22394

Merged

3 tasks

This was referenced May 13, 2026

Cleanup the legacy engines code path #22488

Draft

Reduce peak footprint of cudf-polars test memory usage #22493

Merged

Conversation

madsbk commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

madsbk May 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TomAugspurger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TomAugspurger May 11, 2026

Choose a reason for hiding this comment

Uh oh!

madsbk May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wence- left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

madsbk commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

madsbk commented May 7, 2026 •

edited

Loading